Detection of Nested Mentions for Coreference Resolution in Polish
نویسندگان
چکیده
This paper describes the results of creating a shallow grammar of Polish capable of detecting multi-level nested nominal phrases, intended to be used as mentions in coreference resolution tasks. The work is based on existing grammar developed for the National Corpus of Polish and evaluated on manually annotated Polish Coreference Corpus.
منابع مشابه
Evaluation of Coreference Resolution Tools for Polish from the Information Extraction Perspective
In this paper we discuss the performance of existing tools for coreference resolution for Polish from the perspective of information extraction tasks. We take into consideration the source of mentions, i.e., gold standard vs mentions recognized automatically. We evaluate three existing tools, i.e., IKAR, Ruler and Bartek on the KPWr corpus. We show that the widely used metrics for coreference e...
متن کاملCorefrence resolution with deep learning in the Persian Labnguage
Coreference resolution is an advanced issue in natural language processing. Nowadays, due to the extension of social networks, TV channels, news agencies, the Internet, etc. in human life, reading all the contents, analyzing them, and finding a relation between them require time and cost. In the present era, text analysis is performed using various natural language processing techniques, one ...
متن کاملCombining Syntactic and Semantic Features by SVM for Unrestricted Coreference Resolution
The paper presents a system for the CoNLL2011 share task of coreference resolution. The system composes of two components: one for mentions detection and another one for their coreference resolution. For mentions detection, we adopted a number of heuristic rules from syntactic parse tree perspective. For coreference resolution, we apply SVM by exploiting multiple syntactic and semantic features...
متن کاملPolish Coreference Corpus in Numbers
Abstract This paper attempts a preliminary interpretation of the occurrence of different types of linguistic constructs in the manually-annotated Polish Coreference Corpus by providing analyses of various statistical properties related to mentions, clusters and near-identity links. Among others, frequency of mentions, zero subjects and singleton clusters is presented, as well as the average men...
متن کاملDetección de menciones anidadas basada en expansión para el español
Mention detection is the first module used in coreference resolution systems. Due to that, it is important that the results obtained by this module are as high as possible. Within the field of mention detection, nested mentions are the most difficult ones to detect. In this paper, we present a nested mention detection system based on expansion, a new model for detecting nested elements in NLP b...
متن کامل